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This invention was made in the course of research supported in 
part by grants from the National Institutes of Health (NIH HD 16910, 
HD 04466, and HD 18968). 

TECHNICAL FIELD OF THE INVENTION 

This invention relates to recombinant DNA molecules and th el r 
expression products. More specifically this invention relates to 
recombinant DNA molecules coding for androgen receptor protein, 
androgen receptor protein, and use of the DNA molecules and protein 
in investigatory, diagnostic and therapeutic applications. 


BACKGROUND OF THE INVENTION 

The naturally occurring androgenic hormones, testosterone and 
its 5 -reduced metabolite, dihydrotestosterone, are synthesized by 
the Leydig cells of the testes and circulate throughout the body 
where they diffuse into cells and bind to the androgen receptor 
protein ("AR") . Androgens, acting through their receptor, stimulate 
development of the male genitalia and accessory sex glands in the 
fetus, virilization and growth in the pubertal male, and maintenance 
of male virility and reproductive function in the adult. The 
25 androgen receptor, together with other steroid hormone receptors 
constitute a family of trans-acting transcriptional regulatory 
proteins that control gene transcription through interactions with 
specific gene sequences. 

When prostate cancer is found to be confined to the prostate 
gland, the treatment of choice is surgical removal. However, 50 to 
80% of prostate cancer patients already have metastases at the time 
of diagnosis. Most of their tumors (70 to 80%) respond to the 
removal of androgen by castration or by suppression of luteinizing 
hormone secretion by the pituitary gland using a gonadotropin 
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releasing hormone analogue alone or in combination with an 
ant i- androgen. The degree and duration of response to this 
treatment is highly variable (10% live < 6 months, 50% live < 3 
years, : and 10% live > 10 years.) Initially cancer cells regress 
5 without androgen stimulation, but ultimately the growth of androgen 
independent tumor cells continues (35), At present it is not 
possible to predict on an individual basis which patient will 
respond to hormonal therapy and for how long. If poorly responsive 
patients could be identified early, they could be treated by 

10 alternative forms of therapy (e.g. chemotherapy) at an earlier stage 
when they might be more likely to respond. 

Studies on androgen receptors in prostate cancer have suggested 
that a positive correlation may exist between the presence of 
androgen receptors in cancer cells and their dependence on 

15 androgenic hormone stimulation for growth. (An analogous situation 
exists in mammary carcinoma where there is a correlation between 
estrogen receptors and regression of the tumor in response to 
estrogen withdrawal). However, methodological problems in the 
measurement of androgen receptors have prevented the routine use of 

20 androgen receptor assays in the diagnostic evaluation of prostate 

cancer. Prior to our preparation of androgen receptor antibodies, 

all androgen receptor assays were based on the binding of 
3 

[ H]-labeled androgen. These assays have been unreliable in human 

prostate cancer tissue because of the extreme lability of the 

25 androgen binding site and the presence of unlabeled androgen in the 

tissue. Endogenous androgen occupies the binding site on the 

receptor and dissociates very slowly (t 1/2 24-48 hr at 0C). A 

further problem is that biopsy samples are quite small, making it 

3 

difficult to obtain sufficient tissue for [ H]~androgen binding 
30 assays. Moreover, prostate cancer is heterogenous with respect to 
cell types. Thus within a single biopsy sample there is likely to 
be an uneven distribution of cells containing androgen receptors. 

Development of the male phenotype and maturation of male 
reproductive function are dependent on the interaction of androgenic 
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hormones with the androgen receptor protein and the subsequent 
function of the receptor as a transacting Inducer of gene 
expression. It has become well established over the past 
twenty-five years that genetic defects of the androgen receptor 
result in a broad spectrum of developmental and functional 
abnormalities ranging from genetic males (46.XY) with female 
phenotype to phenotypically normal males with infertility. 

atfon of the structural gene for the androgen receptor makes it 
possible to define the nature of these genomic defects in molecular 
terms. Analysis of the functional correlates of the genetic defects 
ra ay lead to a better understanding of the regulation of androgen 
receptor gene expression and of the mechanism of androgen action in 
male sexual development and function. 

The androgen insensitivity syndrome, known also as testicular 
feminization, is characterized by an inability to respond to 
androgen due to a defect in the androgen receptor, the protein that 
mediates the action of androgen within the cell. Androgen 
insensitivity is an inherited X-linked trait that occurs in bat 
complete and incomplete forms. The complete form resu f-il-r. 
of male sex differentiation during embryogenesi s and absence of 
virilization at puberty. The result is a 46.XY genetic male wit 
testes and male internal ducts. The testes produce normal amounts 
of testosterone and Mullerian inhibiting substance. Consequently 
development of the uterus is inhibited as in the normal male 
Because of the inability to respond to androgen, the external 
genitalia remain in the female phenotype with normal clitoris and 
abia. A small vagina develops from the urogenital sinus and ends 
in a blind pouch. At puberty feminization with breast development 
and female contours occur in response to testicular 
However, there is no growth of sexual hair even t^r, 
testosterone concentrations are equal to or greater than levels in 

the normal male. 4 . 

incomplete forms of the androgen insensitivity syndrome include 

a spectrum of phenotypes resulting from varying degrees of 
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incomplete androgen responsiveness. At one extreme, individuals 
have mild enlargement of the. clitoris and sparse pubic hair* The 
opposite extreme is characterized by more complete masculinization 
with varying degrees of hypospadias deformity but predominantly a 
5 male phenotype. It has been reported that some adult men with 
severe oligospermia or azoospermia who are otherwise normal, have 
defects in the androgen receptor. These may include as many as 10% 
of infertile males. 

The genetic defect eliciting this range of abnormalities is . 

10 thought to be a single biochemical event at the level of the gene 

for the androgen receptor. The androgen receptor is a high affinity 
androgen binding protein that mediates the effects of testosterone 
and dihydrotestosterone by functioning as a trans-acting inducer of 
gene expression. For proper male sexual development to occur, there 

15 is a requirement for androgen and its receptor at a critical time 
during embryogenesi s and during puberty. The majority of 
individuals with the androgen insensitivity syndrome have a history 
of affected family members, although about a third are thought to 
represent new mutations of this X-l inked disorder. The incidence 

20' ranges from 1 in 20,000 to 60,000 male births. 

In studies of families with clinical evidence of the androgen 
- insensitivity syndrome, four main categories were recognized that 
range from the most severe, complete absence of receptor binding 
activity in a genetic male with female phenotype, to qualitatively 

25 normal receptor in affected individuals. Second in severity are 

affected individuals with qualitatively abnormal androgen binding by 
receptor present in normal levels. Examples include the failure of 
sodium molybdate (a reagent often used in studies on steroid 
receptors) to stabilize the receptor of affected individuals when 

30 molybdate is known to stabilize the wild-type receptor. Lability of 
the receptor under conditions that normally cause transformation has 
also been reported. A third group expresses a decreased amount of 
receptor with wild-type in vitro binding characteristics-. The final 
grouping contains those androgen insensitivity patients in 
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whom no abnormality in receptor is detected. In a recent study of 
this form of the syndrome, the androgen receptor was as capable of 
binding oligonucleotides as the wild-type receptor. Indeed, with 
the techniques available until only recently, it has been difficult 
5 in certain cases to document an androgen receptor defect in affected 
individuals. 

Experimental methods used in assessing receptor defects in the 
past have relied on the ability of receptor to bind androgen with 
high affinity. The limitation of this methodology is that it is not 

10 possible to distinguish between the lack of expression of the 

receptor and loss of androgen binding activity. An example of how 
inadequate methodology complicates diagnosis is the absence of 
detectable receptor binding activity in patients who are partially 
virilized. It is theoretically possible for a mutation to occur 

15 which allows the receptor with defective androgen binding activity 
to induce gene transcription. Biologically active truncated forms 
of the glucocorticoid receptor that lack steroid binding activity 
but retain the DNA binding domain have been demonstrated using 
genetically engineered mutants. 

20 Purification of the androgen receptor has been difficult to 

accomplish due to its low concentration and high degree of 
instability. Reported attempts at purification using either 
conventional methods of column chromatography or steroid-affinity 
chromatography have yielded insufficient amounts of receptor protein 

25 to allow even the preparation of monoclonal antibodies. 

An early report on the partial purification of the androgen 
receptor was disclosed by Mainwaring et al. in "The use of DNA - 
cellulose chromatography and isoelectric focusing for the 
characterization and partial purification of steroid-receptor 

30 complexes," Biochem J, 134, 113-127 (1973). They used DNA-cellulose 
chromatography and isoelectric focusing to isolate the receptor from 
rat ventral prostate and determined its physiochemical properties. 
This group was among the first to attempt the use of steroid 
affinity chromatography in conjunction with conventional 
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chromatography, using the affinity label 
17B-bromoacetoxytestosterone in receptor purification (See 
Mainwaring et al., "Use of the affinity label 
17B-bromoacetoxytestosterone in the purification of androgen 

5 receptor proteins," Perspectives in Steroid Receptor Research, 
(1980)). Partial purification of androgen receptor has also been 
attempted from other tissue sources, such as ram seminal vesicles 
(See Foe kens et aU, Molecular Cellular Endocr, 23, 173-186 (1981) 
and Foekens et al,, "Purification of the androgen receptor of sheep 

10 seminal vesicles," Biochem Biophys Res Comm, 104, 1279-1286 

(1982)). The partially purified receptor displayed characteristics 
of a proteolyzed receptor, but a purification of 2,000 fold was 
reported with a recovery of 33% (See Foekens et al., "Purification 
of the androgen receptor of sheep seminal vesicles," Biochem Biophys 

15 Res Comm, 104, 1279-1286 (1982)). Later attempts at purification 
continued to combine steroid affinity chromatography with 
conventional techniques, reportedly achieving significant 
purification," but recoveries too low for further analysis (See Chang 
et al., "Purification and characterization of androgen receptor from 

20 -steer semenal vesicle," Biochemistry 21, 4102-4109 (1982), Chang et 
al., "Purification and characterization of the androgen receptor 
from rat ventral prostate," Biochemistry 22, 6170-6175 (1983) and 
Chang et al., "Affinity labeling of the androgen receptor in rat 

prostate cytosol with 

25 l7B-C(bromoacetyl)oxy]-5-alpha-androstan-3-one," Biochemistry 23, 

2527-2533 (1984)). More recent studies examine the effectiveness of 
a variety of immobilized androgens for their ability to bind the 
androgen receptor (See De Larminat et al., "Synthesis and evaluation 
of immobilized androgens for affinity chromatography in the 

30 purification of nuclear androgen receptor," The Prostate 5., 123-140 
(1984) and Bruchovsky et al, "Chemical demonstration of nuclear 
androgen receptor following affinity chromatography with immobilized 
ligands," The Prostate 10, 207-222 (1987)). Despite these efforts, 
the receptor has not been purified to homogeneity and 
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the quantities of purified androgen receptor obtained have been 
insufficient for the production of antisera. 

Clinical assays for the androgen receptor now include several 
methods. The most common is the binding of tritium-labeled hormone 
5 and measurement of binding using a charcoal adsorption assay. 

Either a natural androgen, such as dihydrotestosterone, or synthetic 
androgen, such as mibolerone or methyl trienol one (R1881), can be 
used. An advantage of the latter in human tissue is that it is not 
significantly metabolized and does not bind to the serum androgen 

10 binding protein, sex steroid binding globulin. A limitation of 
radioisotope labeling of receptor is interference caused by 
endogenous androgen. Although exchange assays for the androgen 
receptor have been described (See Carroll et al., J Steroid Biochem 
21, 353-359 (1984) and Traish et al., J Steroid Biochem 23, 405-413 

15 (1985)), their effectiveness is limited by the slow kinetics of 
dissociation of the endogenous receptor-bound androgen. 

Another method used to assess receptor status is 
autoradiography. In this method disclosed in Barrack et al., 
"Current concepts and approaches to the study of prostate cancer," 

20 Progress in Clinical and Biological Research, 239, 155-187 (1987) 
the radioactively labeled androgen is Incubated with slide-mounted 
tissue sections of small tissue biopsy specimens which are then 
frozen, sectioned and fixed. Nuclear localization of radioactivity 
is detected by exposure of tissue sections to x-ray film. This 

25 technique requires considerable technical expertise, is labor 

intensive, and requires extended periods of exposure time. It is 
therefore of limited usefulness in general clinical assays. Another 
problem is high levels of background signal, i.e. a high 
noise/signal ratio, making it difficult to distinguish 

30 receptor-bound nuclear radioactivity from unbound radioactivity 
distributed throughout the cells. 

WO 87/05049 (Shine) discloses a method for the production of 
purified steroid receptor proteins, specifically estrogen receptor 
proteins, through the expression of recombinant DNA encoding for 
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such proteins in eukaryotic host cells. However, the reference does 
not disclose the sequence for androgen receptor protein, nor does it 
disclose a method for obtaining such a sequence, 

5 SUMMARY OF THE INVENTION 

The present invention provides a DNA sequence characterized by 
a structural gene coding for a polypeptide having substantially the 
same biological activity as androgen receptor protein. A DNA 
sequence encoding androgen receptor protein or a protein having 

10 substantially the same biological activity as androgen receptor 

activity is also provided. DNA sequences may be obtained from cDNA 
or genomic DNA, or prepared using DNA synthesis techniques. 

The invention further discloses cloning vehicles comprising a 
DNA sequence comprising a structural gene encoding a polypeptide 

15 having substantially the same biological activity as androgen 
receptor protein. Cloning vehicles comprising a DNA sequence 
encoding androgen receptor protein or a protein having substantially 
the same biological activity as androgen receptor protein is also 
provided. The cloning vehicles further comprise a promoter sequence 

20 upstream of and operatively linked to the DNA sequence. In general 
the cloning vehicles will also contain a selectable marker, and, 
depending on the host cell used, may contain such elements as 
regulatory sequences, polyadenylation signals, enhancers and RNA 
splice sites. 

25 The invention further provides cells transfected or transformed 

to produce androgen receptor protein or a protein having 
substantially the same biological activity as androgen receptor 
protein, 

A further aspect of the invention provides a purified androgen 
30 receptor protein and purified polypeptides and proteins having 
substantially the same biological activity as androgen receptor 
activity, and methods for producing such proteins and polypeptides. 
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B RIEF DESCRIPTION OF THE DRAWINGS 

FiguPe i shows a comparison of DNA-binding domains of the human 

androgen receptor (MR) with members of the nuclear receptor 
family. (A) is a comparison of oligo A nucleotide sequence with 
sequences of hAR and other nuclear receptors: hPR, human 
progesterone receptor; hMR, human mineralocorticoid receptor; hGR, 
human glucocorticoid receptor; hER, human estrogen receptor; hT3R, 
human thyroid hormone receptor; hRAR, human retinoic acid receptor. 
Chromosomal locations are shown in parentheses at the left. 
Nucleotide identity between oligo A and hAR is indicated with an 
asterisk. The percent homology with oligo A is in parentheses at 
the right of each sequence. (B) shows the the structure of 
fibroblast clone ARHFL1 human fibroblast clone. [1]). Nucleotide 
residues are numbered from the 5'-terminus. Restriction 
endonuclease sites were determined by mapping or were deduced from 
DMA sequence. The TGA translation termination codon, determined by 
comparison with hPR, hMR and hGR, follows a long open reading frame 
containing sequences homologous to those of other steroid 
receptors. Arrows indicate exon boundaries in genomic clone X05AR. 
The hatched area is the putative DMA binding domain. (C) shows a 
comparison of amino acid sequences of the AR DNA-binding domain with 
sequences of the nuclear receptor family. AR amino acid sequence 
was deduced from nucleotide sequence of clone ARHFL1 and is numbered 
beginning with the first conserved cysteine residue (♦). Amino acid 
numbers in parentheses at the left indicate the residue number of 
the first conserved cysteine from the references indicated above. 
Percent homology with hAR is indicated -in parentheses on the right. 
The region of the DNA-binding domain from which the oligo A sequence 
was derived 1s underlined in hAR. Coding DNA of residues 1 to 31 is 
contained within genomic clone X05AR. Abbreviations in addition to 
those described above are cVDR, chicken vitamin D receptor, and 
VERBA, erb A protein from avian erythroblastosis virus. 
Abbreviations for amino acid residues are: 
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A, Ala; C» Cys; D, Asp; E, Glu; F, Phe; G» Gly; H, His; I, lie; K, 
Lys; L, Leu; M, Met; N s Asn; P 9 Pro; Q, Gin; R 9 Arg; S, Ser; T, Thr; 
V, Val; W, Trp; and Y, Tyr. 

Figure 2 illustrates the steroid binding properties of 

5 expressed AR cDNA. (A) shows the structure of pCMVAR in the 

expression vector pCMV containing the human cytomegalovirus(CMV) 
promoter of the immediate early gene, poly (A) addition-transcription 
terminator region of the human growth hormone gene (hGH poly A),SV40 
origin of replication (SV40 Ori), and a polylinker region for 

10 insertion of cDNAs. The plasmid pTEBR contains the ampicillin 
resistance gene (Amp). (B) shows saturation analysis of 
[ 3 H]dihydrotestosterone binding in extracts of pCMVAR transfection 
of COS M6 cells. Portions of cytosol (0.1 ml, 0.3 mg/ml protein) 
were incubated overnight at 4°C with increasing concentrations of 

15 3 H-labeled hormone and analyzed by charcoal adsorption. 

Nonspecific binding increased from 18% to 37% of total bound 

radioactivity. (C) shows a scratched plot analysis of 

[ H]dihydrotestosterone binding. Error estimation was based on 

linear regression analysis (r=0.966). (U) illustrates the 

20 competition of unlabeled steroids for binding of 5 nM 
3 

C H]dihydrotestosterone in transfected COS W6 cell extracts. 
Unlabeled steroids were added at 10- and 100-fold excess of labeled 
hormone. Specific binding was determined as previously described. 
Figure 3 is a compiled clone map of the human androgen 

25 receptor. The map shows the structure of the human androgen 
receptor gene and the relative positions of the nucleic acid 
sequences contained in the cDNA probes [A], [BL [C] and [D], human 
fibroblast clone [1], human epididymis clones [1] and [5], human 
genomic clones [1], [2], [3], [4] and [5], and rat epididymis clones 

3U [1] and [2]. 

Figure 4 shows the complete nucleotide sequence for human 
androgen receptor cDNA and the deduced amino acid sequence. 
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Figure 5 shows the complete nucleotide sequence of the rat 
androgen receptor cDNA and the predicted amino acid sequence. 

Figure 6 is a photograph of a frozen section of rat ventral 
prostate stained with antibodies (AR-52-3-p) to the AR peptide 
b NH 2 -Asp-Hi s-Val -Leu-Pro-Ile-Asp-Tyr-Tyr-Phe-Pro-Pro-Gl n-Lys-Thr i n 
a dilution of 1 to 3000 using the av1d1n-b1otin peroxidase 
technique. Androgen receptor is indicated by brown staining of 
nuclei in epithelial cells. 

Figure 7 is a photograph showing restriction fragment length 
10 polymorphisms in the human androgen receptor gene. 

Figure 8 is a photograph showing a Southern blot analysis in 
the human androgen receptor gene in complete androgen insensiti vity 
syndrome patients. 

15 ( DETAILED DESCRIPTION OF THE INVENTION 

? In the description the following terms are employed: 

Nucleotide 

A monomeric unit of DNA or RNA consisting of a sugar moiety 
(pentose) , a phosphate,, and a nitrogenous heterocyclic base. The 

20 base is linked to the sugar moiety via the glycosidic carbon {V 
carbon of the pentose) and that combination of base and sugar is a 
nucleotide. The base characterizes the nucleotide. The four DNA 
bases are adenine ("A"), guanine ("6"), cytosine ("C") and thymine 
("T"). The four RNA bases are A, G, C and uracil ("U"). A and G 

25 are purines, abbreviated to R, and C, T, and U are pyrimidines, 
abbreviated to Y. 
DNA Sequence 

A linear series of nucleotides connected one to the other by 
phosphodi ester bonds between the 3' and 5' carbons of adjacent 
30 pentoses. 
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Codon 

A DNA sequence of three nucleotides (a triplet) which encodes 
through mRNA an amino acid, a trans! ational start signal or a 
translational termination signal. For example, the nucleotide 
5 triplets TTA, TTG, CTT, CTC, CTA and CTG encode for the amino acid 
leucine ("Leu"), TAG, TAA and TGA are translational stop signals and 
ATG is a translational start signal. 

Reading Frame 

The grouping of codon 's during translation of mRNA into amino 
10 acid sequences. During translation the proper reading frame must be 
maintained. For example, the sequence GCTGGTTGTAAG may be 
translated in three reading frames or phases, each of which affords 
a different amino acid sequence: 

GCT GGT TGT AAG - Ala-Gly-Cys-Lys 
15 6 CTG GTT GTA AG - Leu-Val-Val 

GC TGG TTG TAA A - Trp-Leu-(STOP) 
Polypeptide 

A linear series of amino acids connected one to the other by 
peptide bonds between the a-amino and carboxy groups of adjacent 
20 amino acids. 

Genome 

The entire DNA of a substance. It includes inter alia the 
structural genes encoding for the polypeptides of the substance, as 
well as operator, promoter and ribosome binding and interaction 
25 sequences including sequences such as the Shine-Dalgarno sequences. 
Structural Gene 

A DNA sequence which encodes through its template or messenger 
RNA ("mRNA") a sequence of amino acids characteristic of a specific 
polypeptide o 
30 Transcription 

The process of producing mRNA from a structural gene. 

Translation 

The process of producing a polypeptide from mRNA. 
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Expression 

The process undergone by a structural gene to produce a 
polypeptide. It is a combination of transcription and translation. 
Plasmid 

5 A non-chromosomal double-stranded DMA sequence comprising an 

intact "replicon" such that the plasmid is replicated in a host 
cell. When the plasmid is placed within a unicellular organism, the 
characteristics of that organism are changed or transformed as a 
result of the DNA of the plasmid. For example, a plasmid carrying 

10 the gene for tetracycline resistance (Tet R ) transforms a cell 

previously sensitive to tetracycline into one which is resistant to 
it. A cell transformed by a plasmid is called a "transf ormant" . 
Phage or Bacteriophage 

Bacterial virus many of which include DNA sequences 
lb encapsidated in a protein envelope or coat ("capsid"). In a 

unicellular organism a phage may be introduced as free DNA by a ■ ■ 
process called transfection. 
Cloning Vehicle 

A plasmid, phage DNA or other DNA sequences which are able to 
20 replicate in a host cell, characterized by one or a small number of 
endonuclease recognition sites at which such DNA sequences may be 
cut in a determinable fashion without attendant loss of an essential 
biological function of the DNA, e.g., replication, production of 
coat proteins or loss of promoter or binding sites, and which 
25 contain a marker suitable for use in the identification of 

transformed'cells, e.g., tetracycline resistance or ampicillin 
resistance. A cloning vehicle is often called a vector. 
Cloning 

The selection and propagation of a single species. 
30 Recombinant DNA Molecule 

A hybrid DNA sequence comprising at least two nucleotide 
sequences, the first sequence not normally being found together in 
nature with the second. 
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Expression Control Sequence 

A DNA sequence of nucleotides that controls and regulates 
expression of structural genes when operatively linked to those x 
genes . 

b To attain the objects of this invention it was necessary to 

determine the amino acid sequence and the DNA sequence of the 
structural gene encoding androgen receptor protein. One 
conventional approach would involve starting with the purified 
androgen receptor protein. However, as described above, significant 

10 amounts of the protein for such purposes have not been obtained. 
An alternative approach to circumvent the overwhelming 
difficulty of androgen receptor protein purification is direct 
isolation of the DNA encoding the messenger RNA for androgen 
receptor protein. 

15 Our strategy for isolating AR DNA was based on evidence that 

the AR. gene is X-linked and that no other steroid receptor gene is 
located on the X chromosome. Sequence data are available from cDNAs 
for glucocorticoid , estrogen, progesterone, mineralocorticoid and 
vitamin 0 receptors. Comparison of the derived amino acid sequences 

20 has revealed a central region of high cysteine content which was 
found also in the v-erb A oncogene -product recently identified as 
the thyroid hormone receptor. Within this 61-63 amino acid region 
is an arrangement of 9 cysteine residues that are absolutely, 
conserved among steroid receptors thus far characterized. The 

25 overall homology among sequences in this conserved region ranges 

between 40 and 90%. We assumed that AR would resemble other members 
of the steroid receptor family in the conserved DNA-binding domain. 

A human X chromosomal library was screened with the synthetic 
oligo nucleotide probe A (Oligo A sequence = CTT TTG AAG AAG 
30 ACC TTA CAG CCC TCA CAG GT 3 ) of Figure 1 (A) designed as a 

consensus sequence from the conserved sequence of the DNA-binding 
domain of other steroid receptors. Screening the library with the 
oligo A probe resulted in several recombinants whose inserts were 
cloned into bacteriophage M13 DNA and sequenced. One recombinant 
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clone (Charon 35 X05AR) (human genomic clone [1]) contained a 
sequence similar to, yet distinct from, the DMA-binding domains of 
other steroid receptors. It had 84% sequence identity with oligo A, 
while other receptor DNAs were 78% to 91% homologous with the 

5 consensus oligonucleotide. 

From the nucleotide sequence just 5' of the DNA binding domain, 
oligonucleotide probe B (Oligo B sequence = 5 G6A CCA TGT TTT GCC 
CAT TGA CTA TTA CTT TCC ACC CC 3 ) was synthesized and used to 
screen bacteriophage lambda gtll cDNA libraries from human 

1Q epididymis and cultured human foreskin fibroblasts. Recombinant 
phage (unamplified) screened with this oligonucleotide by in situ 
hybridization revealed one positive clone in each library. The 
epldidymal clone (gtll ARHEL1 ) (human epididymis clone [1]) contained 
the complete DNA-binding domain and approximately 1.5 kb of upstream 

15 sequence, whereas the fibroblast clone (gtll ARHF 1,1 )( human 

fibroblast clone [1]) shown in Figure 1(B) contained the DNA-binding 
domain and 1.5 kb of downstream sequence. The DNA-binding domains 
of the cDNA isolates were identical to that of the genomic exon 
sequence. 

20 Transient expression in monkey kidney cells (COS M6) 

demonstrated that the human foreskin fibroblast cDNA fragment 
encodes the steroid-binding domain of hAR. A DNA fragment 
(ARHFLIH-X) extending 5' to 3' from the H1nd III site within the 
putative DNA-binding domain through the stop codon (TGA) was cloned 

2b into pCMV as shown in Figure 2(A). Expression was facilitated by 
adding to the 5' end a consensus translation initiation sequence 
containing the methionine codon (ATG) in reading frame. 
Transfection of the recombinant construct produced a protein with 
high-affinity for [ 3 H]dihydrotestosterone, Figure 2(C) saturable 

3U at physiological levels of hormone. See Figure 2(B). The binding 
constant [K d - 2.7 (+ 1.4) x 1Q~ 1Q *) was nearly identical to 
that of native AR. The level of expressed protein, 1.3 pmol per 
milligram of protein, was 20 to 60 times greater than that in male 
reproductive tissues. Mock transfections without plasmid or 
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transfections* with plasmid DNA lacking th AR insert yielded no 
specific binding of dihydrotestosterone. Figure 2(D) shows steroid 
specificity was identical to .that of native AR, with highest 
affinity for dihydrotestosterone and testosterone, intermediate 
5 affinity for progesterone and estradiol, and low affinity for 
Cortisol. 

Figure 3 is a clone map compiled to show the human androgen 
receptor gene and the nucleic acid sequences in the cDNA clones, 
human genomic clones, human fibroblast clones, human epididymis 

1U clones, -and rat epididymis clones. Human fibroblast clone [1] 
extended through the stop codon or the C-terminal end of the 
androgen receptor protein. To isolate and elucidate the sequence of 
the 5' or N-terminal end of the androgen receptor protein, we used a 
EcoRl/Sstl fragment (EcoRl site was from the linker) from the 5 1 end 

15 of human epididymis clone [1] as a probe (cDNA probe [A]), to 

rescreen the human X chromosomal library by standard techniques. By 
these techniques, human genomic clone [2] was isolated and in turn 
used as a probe to rescreen a human epididymis library and isolate 
human epididymis clone [5]. The N-terminal sequence was elucidated 

20 along with the 5 1 flanking sequence of the androgen receptor protein 
and gene. Human genomic clones [3], £4] and [5] for the sequence 3 l 
of human genomic clone [1] were obtained using cDNA probes B [a Hind 
III/EcoRl fragment] and C [an EcoRl fragment], by screening and 
isolating by standard techniques. 

25 Two rat clones, rat epididymis clones [1] and [2], were 

isolated from a rat epididymis cDNA library using as probes the 
complete human epididymis clone [1] and a EcoRl /Pstl fragment, cDNA 
probe [D], respectively. These rat clones contained the entire 
protein coding sequence for the rat androgen receptor, plus flanking 

30 5 1 and 3 1 untranslated sequences which were used to confirm the 
sequence of the human androgen receptor. 

The complete double-stranded sequence encoding the human 
androgen receptor protein was determined and ttre deduced amino acid 
sequence of the human androgen receptor protein is set forth 
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in Figure 4. The cDNA sequence and the amino acid sequence for 
the rat androgen receptor protein is set forth in Figure 5. 

Recombinant DNA clones human fibroblast clone [1] isolated from 
human foreskin fibroblast cDNA gtll expression library, human 

5 epididymis clones [1] and [5] isolated from human epididymis cDNA 
gtll expression library were deposited in the American Type Culture 
Collection with accession numbers ATCC # 40439, ATCC t 40442 and 
ATCC # 40440 respectively. Human genomic clones [1], [2], [3], [4] 
and [5] which were isolated from human X chromosome lambda Charon 35 

10 library available as ATCC # 57750 have been deposited with the 
American Type Culture Collection with accession numbers 
ATCC # 40441, ATCC # 40443, ATCC #40444, ATCC # 40445 and 
ATCC # 40446 respectively. 

A wide variety of host=cloning vehicle combinations may be 

15 usefully employed in cloning the double stranded DNA disclosed 
herein. For example, useful cloning vehicles may include 
chroomosomal , non-chromosomal and synthetic DNA sequences such as 
various known bacterial plasmids and wider host range plasmids such 
as pCMV and vectors derived from combinations of plasmids and phage 

20 UNA such as plasmids which have been modified to employ phage DNA 
expression control sequences. Useful hosts may include bacterial 
hosts, yeasts and other fungi, animal or plant hosts, such as 
Chinese Hamster Ovary cells (CHO, or monkey kidney cells (COS M6), 
and other hosts. The particular selection of host-cloning vehicle 

25 combinations may be made by those of skill in the art after due 

consideration of factors such as the source of the DNA- i.e. genomic 
or cDNA. 

Cloning vehicles for use in carrying out the present invention 
will further comprise a promoter operably linked to the DNA sequence 
30 encoding the androgen receptor protein. In some instances it is 
preferred that cloning vehicles further comprise an origin of 
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replication , as well as sequences which regulate and/or enhance 
expression levels, depending on the host cell selected. 

Techniques for transforming hosts and expressing foreign cloned 
in them are well known in the art (See, for example, Maniatis et 
5 al., infra). Cloning vehicles used for expressing foreign genes in 
bacterial hosts will generally contain a selectable marker, such as 
a gene for antibiotic resistance, and a promoter which functions in 
the host cell. 

Eukaryotic microorganisms ^ such as the yeast Saccharomyces 

10 cerevisiae, may also be used as host cells. Cloning vehicles will 
generally comprise a selectable marker, such as the nutritional 
marker TRP, which allows selection in a host strain carrying a trpt 
mutation. To facilitate purification of an androgen receptor 
protein produced in a yeast transformant, a yeast gene encoding a 

15 secreted protein may be joined to the sequence encoding androgen 
receptor protein. 

Higher eukaryotic cells can also serve as host cells in 
carrying out the present invention. Cultured mammalian cells are 
preferred. Cloning vehicles for use in mammalian cells will 

20 comprise a promoter capable of directing the transcription of a 
foreign gene introduced into a mammalian cell. Also contained in 
the expression vector is a polyadenylation signal, located 
downstream of the insertion site. The polyadenylation signal can be 
that of the cloned androgen receptor gene, or may be derived from a 

25 heterologous gene. 

A selectable marker, such as a gene that confers a selectable 
phenotype, is generally introduced into the cells, along with the 
gene of interest. Preferred selectable markers include genes that 
confer resistance to drugs, such as neomycin, hygromycin and 

30 methotrexate. Selectable markers may be introduced into the cell on 
a separate plasmid at the same time as the gene of interest, or they 
may be introduced on the same plasmid. 

The copy marker of the. integrated gene sequence can be 
increased through amplification by using certain selectable 


WO 89/09791 


PCT/US89/01548 


- 19 - 


1U 


15 


20 


25 


30 


markers. Through selection, expression levels may be substantially 

increased, „ fllle 
Androgen receptor proteins may be purified from the host cells 
or cell media according to the present invention using techniques 
well known to those in the art. Such proteins may be utilized to 
produce monoclonal or polyclonal antibodies according to the 

techniques described below. 

The techniques of this invention offer considerable advances 
over existing technology for measurement of androgen receptor, 
utilizing proteins and peptides containing the disclosed sequences 
monoclonal or polyclonal antibodies can be produced for use as 
immunochemical reagents in immunodi agnostic assays. For examp e, 
radioimmunoassays and EL ISA assays can be developed utilizing these 
reagents which will allow detection and quantification of androgen 
receptor in the presence of endogenous androgen since such androgen 
will not interfere with antibody binding to the receptor. 

Immunocytochemistry utilizing our reagents enables ^ 
determination and quantification of the cellular distribution of the 
androgen receptor in tumor tissues, which are often heterogenous in 
composition. This assay offers great potential for diagnostic • 
evaluation of prostate cancer to determine to responsiveness to 

androgen withdrawal therapy. 

in addition, the antibodies produced using the disclosed amino 
acid sequences can also be used in processes for the purification of 
androgen receptor protein produced by the above methods. One such 
purification process is disclosed in Logeat, F., et aK, 
Biochemistry vol. 24 (1985), pp. 1029-1035. which is incorporated by 

reference herein. . 

Androgen receptor proteins and polypeptides synthesized from 

the deduced amino acid sequence can be used as immunogens for the 

preparation of antibodies to the androgen receptor. Peptides for 

such use range in length from about 3 to about-958 amino acids in 

length and are preferably from about 15 to about 30 amino acids in 

length. Shorter peptides may have significant sequence homology to 


WO 89/09791 


PCT/US89/01548 


- 20 - 

other steroid receptor proteins and larger peptides may contain 
multiple antigenic determinants; these properties could result in 
antibodies with cross-reactivities to other steroid receptor 
proteins. 

Peptides can be synthesized from amino acid sequences in the 
NH 2 -terminal region, the DNA-binding domain, and the 
carboxyl -terminal steroid binding domain. Peptide selection will be 
based on hydropathic plots, selecting hydrophilic regions that are 
more likely exposed on the receptor surface. For diagnostic 
purposes preferred sequences will be selected from the 
NH 2 -terminaT region where there is the least homology with other 
steroid receptor proteins. 

Peptides for use as immunogens can be synthesized using 
techniques available to one of ordinary skill in the art. For 
example, peptides corresponding to androgen receptor sequences can 
be synthesized using tBOC chemistry on a Biosearch Model 9500 
peptide synthesizer. Peptide purity is assessed by high pressure 
liquid chromatography. Peptides can be conjugated to keyhole limpet 
hemocyanin through cysteine residues using the coupling agent 
m-maleimido-benzoyl-N-hydroxysuccinimide ester. One. can also 
prepare resin-bound peptides utilizing the p-(oxymethyl benzamide) 
handle to attach the C-terminal amino acid to sol Id-phase resin 
support. 

Proteins and peptides of this invention can be utilized for the 
production of polyclonal or monoclonal antibodies. Methods for 
production of such antibodies are known to_ those of ordinary skill 
in the art and may be performed without undue experimentation. One 
method for the production of monoclonal antibodies is described in 
Kohler, G., et al., "Continuous Culture of Fused Cells Secreting 
Antibody of Predefined Specificity," Nature , vol. 256 (1975), p. 
495, which is incorporated herein by reference. Polyclonal 
antibodies, by way of example, can be produced by the method 
described below. 
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Peptide conjugates op resin-bound peptides can be injected into 
rabbits according to the procedure of Vaitukaitis et al., J Clin 
Endocrinol Metab, 33, 988-991 (1971) using a standard immunization 
schedule. Anti sera titers can be determined in the ELISA assay. 
5 For example, one androgen receptor sequence, 

NH 2 -Asp-His-Val-Leu-Pro-Ile-Asp-Tyr-Tyr-Phe-Pro-Pro-Gln-Lys-Thr 

in the 5' region upstream from the DNA-binding domain, was used to 
raise antisera in rabbits. The antisera react selectively at a 
dilution of 1 to 500 with the androgen receptor both in its 

10 untransformed 8-10S form and in its 4-5S transformed form. Receptor 
sedimentation on sucrose gradients increases from 4 to 8-10S in the 
presence of antiserum at high ionic strength and from 8-10S to 
11-12S at low ionic strength, sucrose gradients. In the ELISA 
reaction against the peptide used as immunogen, reactivity was 

15 detectable at 1 to 25,000 dilution. This antiserum at a dilution of 
1 to 3000 was found effective in staining nuclear androgen receptor 
in rat prostate and other male accessory sex glands (see Figure 6). 

Our invention provides new molecular probes comprising 
complementary DNA sequences derived from the deduced sequences 
' 20 encoding the androgen receptor for diagnostic purposes. Such probes 
may be used to detect the presence of androgen receptor mRNA in 
tumor cells. Such probes may also be used for detection of androgen 
receptor gene defects. Androgen receptor complementary DNA 
sequences can be used as hybridization probes to detect 

25 abnormalities in the androgen receptor gene or in its messenger RNA. 

Androgen receptor DNA sequences disclosed and complementary RNA 
sequences can be used to construct probes for use in DNA 
hybridization assays. An example of one such hybridization assay 
and methods for constructing probes for such assays are disclosed 

30 in U.S. Patent No. 4,683,195 to MulUs et al., U.S. Patent No. 

4,683,202 to Mullis, U.S. Patent No. 4,617,261 to Sheldon, III et 
al., U.S. Patent No. 4,683,194 to Salki et al., and U.S. Patent No. 
4,705,886 to Levenson et al., which are hereby incorporated by 
reference. 
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By example, one method for detecting gene deletion utilizes 
Southern blotting and hybridization, DNA can be isolated from 
cultured skin fibroblasts or from leukocytes obtained from blood. 
DNA is cut with restriction enzymes, electrophoresed on an agarose 
5 gel, blotted onto nitrocellulose, and hybridized with 

C P]-labeled androgen receptor DNA (see Maniatis, T. et al.» 
Molecular Cloning, A Laboratory Manual, Cold Spring Harbor 
Laboratory, 1982,- incorporated by reference herein). 

In addition, small -mutations can be detected utilizing methods 

10 known to one of ordinary skill in the art, from cultured skin 

fibroblasts of the affected individual. A cDNA library can been 

prepared using standard techniques. The androgen receptor. clones 

32 

can be isolated using a [ PJDNA AR probe. The clones AR cDNA can 
then be sequenced and compared to normal AR cDNA sequences* 

15 Alternatively genomic DNA can be isolated from blood leukocytes 

or cultured skin fibroblasts of the affected individual. The DNA is 
then subjected to restriction enzyme digestion, electrophoresis and 
is blotted onto nitrocellulose. Synthetic oligonucleotides can be 
used to bracket specific exons. Exon sequences are amplified using 

20 the polymerase chain reaction, cloned into M13 and sequenced. The 
sequences are compared to normal human AR DNA sequences. 

Another method of identifying small mutations or deletions 
takes advantage of the ability of RNase A to cleave regions of 
single stranded RNA in RNA: DNA hybrids. Genomic DNA isolated from 

25 fibroblasts of affected individuals is hybridized with radioactive 
RNA probes (Promego Biotec) prepared from wild-type androgen 
receptor cDNA. Mismatches due to mutations would be cleaved by 
RNase A and result in altered sized bands relative to wild-type on 
denaturing polyacryl amide gels. 

30 Restriction fragment length polymorphism (RFLP) linked to the 

androgen receptor gene locus may be used in prenatal diagnosis and 
carrier detection of androgen insensitivity. For example, the 
presence of RFLPs in normal individuals is first established- by 
isolating DNA from lymphocytes of at least six females (total of 12 
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X chromosomes). DNA can be isolated using the proteinase K 
procedure and fragmented using a battery of restriction enzymes. 
Preferred are those enzymes that contain the dinucleotide sequence 
CG in their recognition sequence. Southern blots are screened with 
5-10 kb androgen receptor genomic fragments which if possible lack 
repetitive DNA. For those regions containing repetitive elements, 
total human genomic DNA can be added as competitor in the 
hybridization reaction. Alternatively, one can subclone selected 
regions to yield a probe free of repetitive elements. 

For example, a human restriction fragment length was determined 
by cDNA probe (B) and Hind III restriction endonuclease using the 
Southern blot technique (See Figure 7). The two RFLP alleles 
detected are a fragment at 6.5 kb (allele) and a fragment at 3.5 kb 
(allele 2). Major constant fragment bands are seen at approximately 
15 2 and 5 kb with minor constant bands at 0.9 and 7.5 kb. Allele 1 is 
present in approximately 30% of the X chromosomes of the Caucasian 
population. Allele 2 is present in approximately 20% of the X 
chromosomes of the Caucasian population. In Figure 8 Lanes A B and 
D, DNA from women who are homozygous for allele 1 is shown. In 
Figure 8 Lane C. DNA from a woman who is heterozygous for both 
alleles 1 and 2 is shown. Figure 8 Lane E contains DNA from a man 
that only possesses allele 2. This RFLP, and others determined by 
the clones we have isolated, will enable one to monitor the androgen 
receptor gene in various disease conditions described herein. 

An example of using the androgen receptor clones to detect 
mutations is shown in Figure 8 where five different complete 
androgen insensitive patients' DNA are digested with EcoRl , 
electrophoresed on a Southern blot, and probed with cDNA probe B. 
The patient in lane B lacks a 3kb band indicating that part of the 
androgen receptor gene is deleted. Further analysis of this and 
other patients DNA is possible with other AR probes and by 
sequencing by standard methods and comparing the abnormal sequence 
to the normal sequence described herein. 
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Other potential uses for oligonucleotide sequences disclosed, 
for example in construction of therapeutics to block genetic 
expression, will be obvious to one of ordinary skill in the art. • 
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What is claimed is: 

1. A recombinant DNA molecule comprising a DNA sequence 
encoding the structural gene for androgen receptor protein. 

2. The recombinant ONA molecule of Claim 1 wherein the 
androgen receptor protein is a human androgen receptor protein. 

3. A cloning vehicle comprising a genomic DNA molecule which 
upon expression in a eukaryotic host produces androgen receptor 
protein. 

4. The cloning vehicle of Claim 3 wherein the androgen 
receptor protein is a human androgen receptor protein. 

5. An androgen receptor protein produced by translation^ 
the DNA sequence encoding androgen receptor protein in a host 
organism transfected or transformed by the cloning vehicle of Claim 

3. 

6. A human androgen receptor protein produced by translation 
of the DNA sequence encoding human androgen receptor protein in a 
host organism transfected or transformed by the cloning vehicle of 

Claim 4. 
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